Cross-lingual syntactic variation over age and gender
نویسندگان
چکیده
Most computational sociolinguistics studies have focused on phonological and lexical variation. We present the first large-scale study of syntactic variation among demographic groups (age and gender) across several languages. We harvest data from online user-review sites and parse it with universal dependencies. We show that several age and gender-specific variations hold across languages, for example that women are more likely to use VP conjunctions.
منابع مشابه
Cross-Lingual Syntactically Informed Distributed Word Representations
We develop a novel cross-lingual word representation model which injects syntactic information through dependencybased contexts into a shared cross-lingual word vector space. The model, termed CLDEPEMB, is based on the following assumptions: (1) dependency relations are largely language-independent, at least for related languages and prominent dependency links such as direct objects, as evidenc...
متن کاملComparison of the Speech Syntactic Features between Hearing-Impaired and Normal Hearing Children
Introduction: The present study seeks to describe and analyze the syntactic features of children with severely hearing loss who had access to the hearing aids compared with children with normal hearing, assigning them to the same separate gender classes. Materials and Methods: In the present study, eight children with severe hearing impairment who used a hearing aid and eight hearing children...
متن کاملFBK: Cross-Lingual Textual Entailment Without Translation
This paper overviews FBK’s participation in the Cross-Lingual Textual Entailment for Content Synchronization task organized within SemEval-2012. Our participation is characterized by using cross-lingual matching features extracted from lexical and semantic phrase tables and dependency relations. The features are used for multi-class and binary classification using SVMs. Using a combination of l...
متن کاملCross-lingual Transfer for Unsupervised Dependency Parsing Without Parallel Data
Cross-lingual transfer has been shown to produce good results for dependency parsing of resource-poor languages. Although this avoids the need for a target language treebank, most approaches have still used large parallel corpora. However, parallel data is scarce for low-resource languages, and we report a new method that does not need parallel data. Our method learns syntactic word embeddings ...
متن کاملVariations of Ethmoid Roof in the Iranian Population: A Cross Sectional Study
Introduction: This study aimed to investigate the distribution of ethmoid roof variation and symmetry according to Keros classification. Materials and Methods: This cross-sectional study assessed the paranasal CT scans of 600 patients over 18 years of age with no history of surgery, trauma, or localized fracture in the ethmoid, nose, and anterior skull...
متن کامل